Protein side-chain placement: probabilistic inference and integer programming methods

نویسندگان

  • Eun-Jong Hong
  • Tomás Lozano-Pérez
چکیده

The prediction of energetically favorable sidechain conformations is a fundamental element in homology modeling of proteins and the design of novel protein sequences. The space of side-chain conformations can be approximated by a discrete space of probabilistically representative side-chain conformations (called rotamers). The problem is, then, to find a rotamer selection for each amino acid that minimizes a potential energy function. This is called the Global Minimum Energy Conformation (GMEC) problem. This problem is an NP -hard optimization problem. The Dead-End Elimination theorem together with the A∗ algorithm (DEE/A∗) has been successfully applied to this problem. However, DEE fails to converge for some complex instances. In this paper, we explore two alternatives to DEE/A∗ in solving the GMEC problem. We use a probabilistic inference method, the max-product (MP) belief-propagation algorithm, to estimate (often exactly) the GMEC. We also investigate integer programming formulations to obtain the exact solution. There are known ILP formulations that can be directly applied to the GMEC problem. We review these formulations and compare their effectiveness using CPLEX optimizers. We also present preliminary work towards applying the branch-and-price approach to the GMEC problem. The preliminary results suggest that the max-product algorithm is very effective for the GMEC problem. Though the max-product algorithm is an approximate method, its speed and accuracy are comparable to those of DEE/A∗ in large side-chain placement problems and may be superior

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate Inference in Graphical Models using LP Relaxations

Graphical models such as Markov random fields have been successfully applied to a wide variety of fields, from computer vision and natural language processing, to computational biology. Exact probabilistic inference is generally intractable in complex models having many dependencies between the variables. We present new approaches to approximate inference based on linear programming (LP) relaxa...

متن کامل

Mathematical Programming Methods for Reasoning under Uncertainty

We survey three applications of mathematical programming to rea soning under uncertainty a an application of linear programming to probabilistic logic b an application of nonlinear programming to Bayesian logic a combination of Bayesian inference with probabilistic logic and c an application of integer programming to Dempster Shafer theory which is a method of combining evidence from di erent s...

متن کامل

Side Chain-Positioning as an Integer Programming Problem

An important aspect of homology modeling and protein design algorithms is the correct positioning of protein side chains on a fixed backbone. Homology modeling methods are necessary to complement large scale structural genomics projects. Recently it has been shown that in automatic protein design it is of the uttermost importance to find the global solution to the side chain positioning problem...

متن کامل

A multi objective mixed integer programming model for design of a sustainable meat supply chain network

In the recent decades, rapid population growth has led to the significant increase in food demand. Food supply chain has always been one of the most important and challenging management issues. Product with short age, especially foodstuffs, is the most problematic challenges for supply chain management. These challenges are mainly due to the diversity in the number of these goods, the special n...

متن کامل

Global Supply Chain Management under Carbon Emission Trading Program Using Mixed Integer Programming and Genetic Algorithm

In this paper, the transportation problem under the carbon emission trading program ismodelled by mathematical programming and genetic algorithm. Since green supply chain issuesbecome important and new legislations are taken into account, carbon emissions costs are included inthe total costs of the supply chain. The optimisation model has the ability to minimise the total costsand provides the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003